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^ ; I Abstract. In this paper we investigate the applicabihty of a recently introduced 

^^ ■ primal-dual splitting method in the context of solving portfolio optimization problems 

Cn , which assume the minimization of risk measures associated to different convex utility 

b^ I functions. We show that, due to the splitting characteristic of the used primal-dual 

_^ • method, the main effort in implementing it constitutes in the calculation of the prox- 

imal points of the utility functions, which assume explicit expressions in a number of 
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cases. When quantifying risk via the meanwhile classical Conditional Value- at-Risk, an 
alternative approach relying on the use of its dual representation is presented as well. 
The theoretical results are finally illustrated via some numerical experiments on real and 
synthetic data sets. 

Keywords, portfolio optimization, convex risk measure, utility function, primal-dual 
proximal splitting algorithm 



=3 ! AMS subject classification. 90C25, 90C46, 47A52 

1 Introduction and preliminaries 

>■ 

■^ ■ In financial mathematics, quantifying the risk of future random outcomes is an essential 

^> , concern for decision makers who, of course, have their own attitude towards risk. In the 



classical portfolio theory by Markowitz, the variance was used to measure the risk of 
future returns. However, subsequent developments in the theory of risk measures showed 
(^ . that the variance does not have desirable properties, one major drawback being given 

CO I by the fact that it measures deviations of the random variable in both directions, i. e. it 

penalizes losses and gains in the same way. 

When dealing with uncertainty, modern risk measures build on the wide-spread recog- 
^ I nition that asymmetry is a desirable property since investors have different positions on 

downside and upside outcomes. The first axiomatic way of defining risk measures has 
been given by Artzner, Delbaen, Eber and Heath in [1] and refers to coherent risk mea- 
sures. Nevertheless, it has become a standard in modern risk management to assess the 
riskiness of a portfolio by means of convex risk measures, introduced by Follmer and 
Schied in [17], as well as of convex deviation measures, introduced by Rockafellar, Urya- 
sev and Zabarankin in [23]. The convex deviation measures are connected with the risk 
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measures when those are apphed to the difference between a random variable and its ex- 
pectation, instead of to the random variable itself, both being designed to be used in risk 
analysis. However, deviation measures evaluate uncertainty in the form of nonconstancy, 
whereas risk measures can be understood as estimates of capital requirements for future 
net worths. 

In this paper we give a unifying framework for solving portfolio optimization problems 
assuming the minimization of a general convex risk functional subject to constraints on 
the expected return of the portfolio and on the budget. The convex risk measure in the 
objective is expressed in terms of the Optimized Certainty Equivalent (OCE), a funda- 
mental concept for quantifying risk by means of a utility function introduced by Ben-Tal 
and Teboulle in [3] (see also [4]). For the particular case of the Conditional Value-at- 
Risk an alternative approach involving its dual representation is also considered. The 
approach we propose in this paper assumes the solving of these constrained optimiza- 
tion problems, which in their majority have an nondifferentiable objective function with 
an intricate formulation, via a primal-dual proximal splitting method which has been 
recently introduced in [10]. In the last years one can notice an increasing interest in 
primal-dual algorithms when solving nondifferentiale convex optimization problems (see, 
for instance, [6,9-14,25]), as they achieve a full splitting which assumes a separate eval- 
uation of each function via its proximal points, while the occurring linear continuous 
operators and their adjoints are only evaluated via forward steps. Most primal-dual al- 
gorithms also allow inexact evaluations of the proximal points which, however, can have 
a negative impact on the overall performance. On the other hand, in a lot of applica- 
tions, as it will be also the case for majority of the convex risk measures considered in 
this paper, exact implementations of the proximal operators are possible. We refer to 
the mentioned literature for applications of the primal-dual methods in image and signal 
processing, location theory and machine learning. 

The structure of the paper is the following. In the remaining of this subsection we give 
some elements of convex analysis, present the primal-dual proximal algorithm along with 
its convergence behaviour and introduce the necessary apparatus for defining convex risk 
measures. Section 2 is dedicated to the formulation of the portfolio optimization problem 
to be solved, when the risk is quantified via the Optimized Certainty Equivalent, and 
to investigations on the applicability of the primal-dual method in this context. In 
Section 3 an alternative approach for solving the portfolio optimization problem having 
as objective the Conditional Value-at-Risk is presented. We illustrate the applicability 
of the proposed primal-dual method in the context of portfolio optimization problems in 
Section 4 by some numerical experiments on real and synthetic data. 

1.1 Convex analysis 



Let ?{ be a real Hilbert space with inner product (•, •) and associated norm \\-\\ = y/{-, •). 
The symbol M++ denotes the set of strictly positive real numbers and M-(- := M-|_+ U {0}. 
For a given set S ^ H, the function (^^ : "H — )• M := M U {±oo}, defined by Ss{x) = 
for X G 5 and 6s{x) = +oo, otherwise, denotes its indicator function. For a function 
/ : "H — )■ M we denote by dom/ := {x & 7i : f{x) < -|-oo} its effective domain and call / 
proper if dom/ ^ and f{x) > — oo for all x € H. Let be 

r('H) := {/ : "H — 7- M : / is proper, convex and lower semicontinuous}. 

The conjugate function of / is /* : 7^ — > M, f*{p) = sup{(p, x) — f{x) : x G 7i} for 
all p £ 7i and, if / G r('H), then /* G r('H), as well. The (convex) suhdifferential of 



f -.n ^Rat X Gnistheset df{x) = {p e V. : f{y) - f{x) > {p,y- x) Vy E U], if 
f{x) € M, and is taken to be the empty set, otherwise. For a linear continuous operator 
L : 7i —^ Q, the operator L* : G ^ Ti, defined via {Lx, y) = {x, L*y) for all x € H and all 
y & G, denotes its adjoint. 

Having two proper functions f,g:7i—^M., their infimal convolution is defined by 
fOg : "H — > M, (fDg){x) = inf^g-jf^ {/(y) + g{x — y)} for all x G 7i, being a convex 
function when / and g are convex. The parallel sum of the subdifferentials of / and 
g, seen as set- valued operators, is defined as dfOdg : H ^ H, {dfDdg){x) = {p G 
n : x e {df)-^{p) + {dg)-^{p)}, where {dfy^{p) = {x £ n : p e df{x)}. One has 
that {dfndg){x) C d{fng){x) for every x £ %. For / G r('H), its subdifferential 
df : Ti ^Ti is a maximally monotone operator (cf. [19]) and by Proxj(x) we denote the 
proximal point of / at x S H, representing the unique optimal solution of the optimization 
problem 



mj|/(y) + i||y-xf|. (1.1) 



yen 

For every 7 G IR++ we have Moreau's decomposition formula (cf. [2, Theorem 14.3]) 

Id = Prox^/ +7Prox.^-ij. 07-ild, (1.2) 

where Id denotes the identity operator on "H. When S* C "H is a nonempty convex and 
closed set, the proximal point oi 5s at x £ 7i is 

Prox5£,(x) = Vs{x) = argmin -\\y — x|p, 

being nothing else than the projection of x on S. 

For % and t/j, i = l,...,?7i, given real Hilbert spaces, / G r(?^), g'j,ij G r(t/j), 
i = l,...,r?T., and Li : T-L ^ Qi, i = l,...,m, nonzero linear continuous operators we 
consider the convex optimization problem 



(P) mj|/(rE) + ^(5.nZ,)(L,x)| 
and its Fenchel-type conjugate dual problem (see, for instance, [5, 10, 13]) 



{D) sup -/* - E L*v, - E i9*{v.) + n{v^)) • 

{«!,.. .,l'm)e6lX...X5„i [ \ i — l J i — l ) 

By denoting with v{P) and f (-D) the optimal objective values of the problems (P) and 
(D), respectively, in general one has weak duality, i.e., v{P) > v{D). Strong duality, 
which is the situation when v{P) = v{D) and the dual problem {D) has an optimal 
solution, holds, when some appropriate qualification condition is fulfilled. The following 
error-tolerant proximal splitting algorithm which is suitable for simultaneously solving 
the problems (P) and {D) was given in [10, Algorithm 3.1]. Its competitiveness is empha- 
sized by several numerical experiments in the context of location and image processing 
problems, also in comparison to other recently introduced iterative schemes. 

Algorithm 1.1. 

Let xq G "H, (i'i,o, • • • ) Vmfl) G ^1 X ... X Q.^ and r and cjj, i = 1, ..., m, be strictly positive 
real numbers such that 

m 



Furthermore, let {Xn)n>o be a sequence in (0,2), (a„)„>o a sequence in 7i, {bi^n)n>o and 
{di,n)n>o sequences in Qi for all i = 1, . . . , ttt. and set 

Pl^n = PrOX^/ (x„ - f TT=1 ^'"^,n) + On 

For i = 1, . . . ,m 



(Vn > 0) 



:i.3) 



Zl,n — Wi^n — 2 l^i=l ^iW2,i,n 
Xn+1 = Xn + \n{zi,n - Pl,n) 

For i = 1, . . . ,m 

Z2,i,n = PrOX^^i* {W2,i,n + ^Li{2zi^n " l^l.n)) + C^j,r; 
^'i.n+l = Vi^n + A„(z2,i,n — P2,i,n)- 



Remark 1.1. When / = 6 so}, the infimal convolution gDl is nothing else than the 
function g. In this situation, the conjugate of / is the function identical to zero and for 
all a G IK++ one has Proxg-i* = Id. 

The subsequent theorem was given in [10, Theorem 3.1] and characterizes the con- 
vergence behaviour of the sequences generated by Algorithm 1.1. 

Theorem 1.1. Suppose that there exists x ^T-i such that 

m 

G df{x)+Y,LU{dgiUdk){Ux)). 
(i) If 



:i.4) 



1=1 



X! '^nll'^^nllw < +°0' X! ^n{\\dt^n\\g, + ||&j,n||e,) < +00, i = 1, . . . ,m. 



n=0 



n=0 



and Y.n=o ^n(2 - ^n) = +00, then 

(a) {xn,vi^n, • • • 1 Vm,n)n>o Converges weakly to a point {x,vi, . . . ,Vm) £ Ti x Qi x 
... X Qfn such that, when setting 



Pl = FlOXrf U - 77 Zl ^ 



^i^i : 



anc 



i=l 

a. 



d P2,i = Prox.^^g. Vi + —Li{2pi - x) \ , i = 1, ..., m. 



Pi is an optimal solution to the primal problem (P), {P2i^- ■ ■ jP2m) ^'-^ '^^ 
optimal solution to the dual problem (D) and v{P) = v{D). 

(b) An(zi,„ - Pi,n) -)■ (n -)■ +00) and \n{z2-i,n - P2,i,n) -^ (n ^ +00) for 
i = 1, ...,m. 

(c) whenever % and Gi,i = l,...,m, are finite- dimensional Hilbert spaces, a„ — > 
(n ^ +00) andbi^n ^- (n ^ +00) fori = 1, ...,m, then (pi,n)n>o converges 
strongly to an optimal solution to (P) and {p2,i,m ■ ■ ■ ,P2,m,n)n>Q converges 
strongly to an optimal solution to (D). 



(ii) If 

+00 +00 

flldirillc- + ll&i«IL,l < +00, i = 1 m, 

n>0 



^\\an\\H<+oo, ^{\\di,n\\g, + \\bi,n\\gi) < +00, i = I,..., m, mfA„>0 

n=0 n=0 



and f and g*,i = 1, ...,m, are uniformly convex, 

then (pi,n)n>o converges strongly to an optimal solution to the primal problem (P), 
(P2,i,n) • • • ,P2,m,n)n>o Converges strongly to an optimal solution to the dual problem 
(D) 'andv{P) = v(b). 

1.2 Convex risk measures 

Let (il,5^, P) be an atomless probability space, where the elements uj oi Q represent 
future states, or individual scenarios (and are allowed to be only finitely many), ^ is 
a a-algebra on measurable subsets of Q and P is a probability measure on ^. For a 
measurable random variable X : fi — ?> M U {+00} the expectation value with respect to 
P is defined by E[X] := /^^(a;) dP(a;). Whenever X takes the value +00 on a subset 
of positive measure we have M[X] = +00. Equalities between random variables are to 
be interpreted in an almost surely (a.s.) way. Random variables X : fi — ?• R U j+co} 
which take a constant value A G M, i.e X = A a.s., will be identified with the real number 
A. Similarly, inequalities of the form X>\, X<\, X<Y, etc., are to be viewed 
in the sense of holding almost surely. By Fx we denote the distribution function of X, 
i.e. -Fx(A) = P(X < A). By taking this into account, essential supremum and essential 
infimum of a random variable X are, respectively, 

essup(X) = inf {a G M : P(X > a) = 0} = inf {a G M : X < a} 
essinf (X) = — essup(— X) = sup {a G M : X > a} . 

Each random variable X can be represented as X = X+ — X_, where Xj^, X_ are random 
variables defined via X+(a;) = max{X(c(;), 0} and X_(a;) = max{— X(a;), 0} for all a; G ^2. 
Consider further the real Hilbert space 

L^ := L^{n,d,F) = Ix -.n ^ MU l+cx)} : X is measurable, / |X(a;)p dP(a;) < +00 j 
endowed with inner product and norm defined for arbitrary X,Y £ L^ via 



{X,Y) = / X{uj)Y{uj)dF{uj) and ||X|| = {{X,X))2 = / {X{oj)y dF{uj) 

respectively. 

Definition 1.1 (Risk functions). A proper function /? : L^ — > M is called risk function. 
The risk function p is said to be 

(i) convex, if p{XX + (1 - X)Y) < \p{X) + (1 - \)p{Y) for ah A G (0, 1), X, Y G L^; 

(ii) positively homogeneous, if p(0) = and p{XX) = Ap(X) for all A G IR++, X G L^; 

(iii) monotone, if X > y implies p{X) < p{Y) for all X,Y G L^; 

(iv) cash-invariant, if p(X + c) = p(X) — c for all c G M, X G L^; 

(v) a convex risk measure, if p is convex, monotone and cash-invariant; 



(vi) a coherent risk measure, if p is a positively homogeneous convex risk measure. 

Axioms for coherent risk measures were first given in the Uterature by Artzner, Del- 
baen, Eber and Heath in [1], while later one, Follmer and Schied considered in [17] the 
convex risk measures, by replacing the sublinearity with the weaker assumption of con- 
vexity. More precisely, when the value p{X) is understood as a capital requirement for 
the future net worth X, a convex risk measure guarantees that the capital requirement 
of the convex combination of two positions does not exceed the convex combination of 
the capital requirements of the positions taken separately. For properties and examples 
of coherent and convex risk measures we refer to [1,4,7,15,16,18,20-23]. 

In our investigations a central role will be played by a generalized convex risk measure 
associated to the so-called Optimized Certainty Equivalent, which was introduced for 
concave utility functions in [3] and adapted to convex utility functions in [7]. For the 
utility functions considered throughout this paper we make the following assumption. 

Assumption 1.1 (Convex utility function). Let u : M ^ M be a proper, convex, lower 
semicontinuous and nonincreasing function such that n(0) = and —1 € du{0). 

In the literature the two conditions imposed on u are known as the normalization 
conditions and are equivalent to u(0) = and u{t) > —t for all t E M. The generalized 
convex risk measure we use in order to quantify the risk was given under the name 
Optimized Certainty Equivalent (OCE) in [4] and is defined as (see, also, [7]) 

p„ : L^ ^ M U {+oo}, puiX) = inf {A + E [n(X + A)]} . (1.5) 



By Assumption 1.1, it follows that Pu{X) > — E [X] for every AT G L^ and that pu fulfills 
the requirements of being a convex risk measure. 

2 Solving a general portfolio optimization problem 

Consider a portfolio with a number of A^ > 1 different positions with returns Ri € L^, 
i = 1, . . . ,N, a nonzero vector of expected returns p = (E [Ri] , . . . , E [R^])'^ and p* < 
maxj=i^...^Ar E [Ri] a given lower bound for the expected return of the portfolio. In this 
section we discuss the employment of Algorithm 1.1 when solving for different convex 
utility functions the optimization problem 

/ N \ 

" (2.1) 




which assumes the minimization of the risk of the portfolio subject to constraints on the 
expected return of the portfolio and on the budget. Here, 1 denotes the vector in R 
having all entries equal to 1. By using (1.5), we obtain the following reformulation of the 
problem (2.1) 



inf <^ A + E 

= (a:i,...,Xiv)^eM:^, AgM 



■ / N ^ 

u ^ XiRi + A 



(2.2) 



which will prove to be more suitable for being solved by means of the primal-dual prox- 
imal splitting algorithm presented in the previous section. In this sense, the following 
result, which relates the optimal solutions of the two optimization problems is of certain 
importance. 

6 



Proposition 2.1. The following statements are true. 

(a) If (x, A) is an optimal solution to (2.2), forx = {xi, . . . ,xn)'^ , then x is an optimal 
solution to (2.1). 

(b) Ifx = (xi, . . . ,xnY' is an optimal solution to (2.1) and 

/ N 



A G arg inin < A + E 

A6R I 



U 



^ XiRi + A 



\i=l 



then (x, A) is an optimal solution to (2.2). 
Proof. Denote by ;f = |x € M+ : x^/i > /i*, x^l^ = l}. 
(a) Since (x, A) € -^ x M is an optimal solution to (2.2), we have for every (x, A) € X ^. 



A + E 



N 



U 



^ XiRi + A 



^i=l 



> A + E 



N 



^ XiRi + A 



u=l 



> pu i^^XiRij 



Passing to the infimum over A G M yields 



Pu f Yl ^i^i ) ^ A" ( X! ^i^i ) '''^ ^ -^^ 



hence, x G ^ is an optimal solution to (2.1). 
(b) The conclusion follows by noticing that for every (x, A) G r^" x M we have 

N \-\ / N \ / N 



A + E 



M / \ / 

Y^ XiRi + A I > PuiY ^i^i ) ^ P^iY '^i^i 



N 



= A + E 

Remark 2.1. A sufficient condition guaranteeing that 

arg min {A + E [-u (X + A)]} / VX G L^ 



u 



Y XiRi + A 



u=l 



D 



was given in [7, Theorem 4] and reads 

{(i G M : Uoo{d) = -d] = {0}, 



(2.3) 



where Uoo : K ^ M, Uoo{d) = sup{n(x + d) — u{x) : x G domu}, denotes the recession 

function of the function u. Moreover, in the light of the same result, it follows that under 

(2.3) 

Pu{X)= sup {{X,E)-E[u*{E)]}\fXeL^ 
HeL2 

E(H)=-1 

thus Pu is lower semicontinuous. Since X is compact, this further implies that (2.1) 
has an optimal solution and, consequently, that (2.2) has an optimal solution, too. All 
particular convex utility functions we deal with in this paper fulfill condition (2.3). 



According to Proposition 2.1, determining an optimal solution to problem (2.2) will 
lead to an optimal solution to the portfolio optimization problem (2.1). However, as we 
will show in the following, problem (2.2) is a particular case of the problem (P), thus it 
can be solved by Algorithm 1.1, but also by some other primal-dual proximal splitting 
methods. In order to show this, let us first consider the linear (hence continuous) operator 

N 
i=l 

In order to determine its adjoint operator K* : L^ -^ M^ x M we use that 

{K{x, A), Z)= J Ifl XiR^{^) + ^ ) Z{lo) dV{Lo) =Y.Xi {Ri, Z) + \ (1, Z) 

= {ix,\),K*Z) 
for all (x. A) G M^ X M and all Z £ L^ and get 

K*z = {{Ri,z),...,{RN,z),E[z]f yz eL\ 

Further, by considering the convex and closed sets 

r={xGM^:x^l^ = l}, 
the optimization problem (2.2) can be equivalently written as 

inf {5^N{x) + X + 5sMx,X)+STxRix,X) + (E[u]oK){x,X)}. (2.4) 

It is obvious that the functions (x. A) i-^ 6^n{x) + A, 6sxR and 5txR are proper, 
convex and lower semicontinuous. Furthermore, in the light of Assumption 1.1 and by 
using Fatou' lemma, it follows that E [u] has these properties, as well. This means that 
problem (2.4) fits into the formulation of the problem (P). 

Remark 2.2. For utility functions fulfilling Assumption 1.1 and condition (2.3), we have 
already seen that the optimization problem (2.4) has an optimal solution. Due to the 
fact that a Slater- type qualification condition is fulfilled, from here it follows (see [5]) 
that condition (1.4) in Theorem 1.1 holds. 

By having a closer look into the formulation of Algorithm 1.1, one can notice the 
exposed role played by the proximal points of the functions occurring in the objective of 
the problem to be solved. Having these determined, one can easily obtain via (1.2) the 
proximal points of their conjugates, when needed. It is an easy calculation to see that 

for (x. A) G M^ X M and 7 G M++ it holds 

Prox^/(x,A)= argmin \-fi^ + -\\{y,u) - {x,X)f \ = (v^n (x) , X- 'j) , 

with 

/ : M^ X M ^ 1, /(y, i/) = 5^N{y) + u, 



Prox^5sxE(^' A) = {Vs {x) , A) and Prox^5j,^g(x, A) = (Vt (x) , A) , 
where (see, for instance, [2, Example 28.16 and Example 3.21]), 

f X, if x'^/X > /i* 1 _ x'^1^ ^^ 

^-(^) = . + ^^^^, otherwise ^^^ ^^^^^ = ^ + ^^^ 

^ 11/^11 

As we show below, in order to determine the proximal points of E [u] one needs more 
intricate arguments. 

Proposition 2.2. For arbitrary random variables X £ L^ and 7 G IR++ it holds 

Prox^lg[u](X)(a;) = Prox^^j {X{uj)) Vo; € 17 a. s.. (2-5) 

Proof. We have 

Prox^EH(^) = argmin (7E [n(y)] + ^ ||y - Xf] 

= argmin (7 / u{Y{oj)) dP(w) + - / {Y{oj) - X{uj)f dP(w) 
ygL2 L "'n 2 Jn 



argmin / (-fu{Y{uj)) + -(y(c^) - X{oj))A dF{oj). 
YGL'2 Jn\ 2 J 



Hence, using the interchangeability of integration and minimization (see [24, Theorem 
14.60]), we have 

Prox^Er„i(X)(a;) = argmin I -fu{y) + - {y - X{uj)f \ = Prox^„ {X{uj)) Vw € fi a. s.. 
yeR I ^ J 

D 

In what follows we provide explicit formulae for the proximal points of some popular 
convex utility functions considered the literature, which will be of importance for the 
numerical experiments presented in the last section and which involve the convex risk 
measures which rely on them. 

2.1 Piecewise linear utility 

For 72 < — 1 < 71 < we consider the piecewise linear utility function 

ui:M^M, ui(t) = | ;^jj| ![J^° =j,[t]^-j,[t]_. 

Assumption 1.1 is fulfilled since ui(0) = and —1 G dui{0) = [72,71] and, since for all 
deR (see [7]) 

{72d, if d < 0, 
0, if d = 0, 
7id, if d > 0, 

condition (2.3) is fulfilled, as well. Hence, ui gives rise to the lower semicontinuous 
coherent risk measure 

p„,(X) = inf fA + 7iE[X + A]+-72E[X + A]_} VX G L^. (2.6) 



For every 7 G M_|_+ and t G M it holds 

Prox^„i (t) = argmin <^ 7 (71 [s]^ - 72 [s]_) + - (s - t)H 

t-772, ift<772 

0, if t G [772,771] 

t-771, if i > 771 

= [t-77i]+- [i-772]-- 

When setting 71 = and 72 = — ^3^ for some a G (0, 1), the convex risk measure (2.6) 
becomes the classical so-called Conditional Value-at-Risk at level a (see, for example, 

[20,21]) 

CVaR„ ■.L'^^R, CVaR„(X) = inf (a + ^— E [X + A]_l . (2.7) 

AeR 1^ 1 — a J 

The infimum in the in the expression of the Conditional Value-at-Risk is attained for 
every X G L^ at the so-called Value-at-Risk at level a, i.e., 



VaR„(X) = argmin \x + —^E [X + A] 
AeM I I- a 



2.2 Exponential utility function 

Consider the exponential utility function n2 : M — t- M, U2{t) = exp(— t) — 1. It fulfills 
Assumption 1.1 and, since (^2)00 = ^[c+oo), condition (2.3) is fulfilled, as well. It gives 
rise via (1.5) to the so-called entropic risk measure 

Pu2{X) = inf {A + E[exp(-X - A) - 1]} VX G L^ (2.8) 



which is a lower semicontinuous convex risk measure. For arbitrary 7 G IR++ and t G M 
it holds 

Prox^„2 (i) = argmin <^ 7(exp(-s) - 1) + - (s - t)^ [ 

Although no closed form expression for the proximal points of 7U2 can be given, these can 
be efficiently calculated by applying Newton's method under the use of previous iterates 
as starting points. 

2.3 Indicator utility function 

By choosing the utility function M3 : M — )• M, u^{t) = '^[o,+oo)(0) oiie has (1x3)00 = (^[c+oo), 
thus, both Assumption 1.1 and condition (2.3) are fulfilled. It gives rise to the so-called 
worst-case risk measure 

Pu;{X) = inf A = -essinf X = essup(-X) VX G L^, (2.9) 

X+X>0 

which is a lower semicontinuous convex risk measure. For arbitrary 7 G R++ and t G M 
it holds 

Prox^^,3 it) = argmin <^ 7'^[o,+oo)(s) + i:{s-tf\ = P[o,+oo)(i)- 
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2.4 Quadratic utility function 

For a fixed (3 € M_|__|_ we consider the quadratic utility function 

Obviously, (ti4)oo = <^[o,+oo)! thus, both Assumption 1.1 and condition (2.3) are also 
fulfilled for this utility function. For arbitrary 7 € K++ and t € M, it holds 

1' -^^l.f 1^, ift<^ 



Prox^„4 (t) = argmin \ 7^4(5) + - (s - i) 

s6R L ^ 



t, if * > ;5 



2.5 Logarithmic utility function 

For 9 G M_|_+, we consider the logarithmic utility function 

n5:M^M, n5(t) = |^^^^ '' if t < -^ • 

For this special utility function, one can also show that (m5)oo = '5[o,+oo)) hence that (2.3) 
is fulfilled. The properties in Assumption 1.1 hold as well and therefore, via (1.5), we 
obtain the convex risk measure 

Pu,{X) = mf [\ - 0E [in (l + ^i^) 1 1 VX G L\ 
x+\>-e 

The proximal points of the logarithmic utility function take an explicit expression. For 
arbitrary 7 G IR++ and t G M, it holds 



Prox^,, (i) = argmin |-701n {l + '-^ +]^{s -tf^ = ^ + W ^^ ^^ + ^(7 + t) 



s>-6 



3 An alternative approach for CVaR 

In this section we propose an alternative approach for solving the portfolio optimization 
problem (2.1) when the risk measure in the objective is the Conditional Value-at-Risk 
at a given confidence level a G (0, 1). To this aim we work in a discrete probability 
space with il finite, which is the natural framework in real-life applications. We denote 
by \Vt\ the cardinal of the set Vi. Thus, the probability measure P can be represented 
as a vector (pi, ...,P|n|) € M'^' with Pi > 0,i = l,...,|ri|, and YJi=iPi = 1 ^-iid the 
space of random variables L^ can be identified with the finite-dimensional space RI^L 
The investigations made in this section rely on the following dual representation of the 
Conditional Value-at-Risk given in [18,22], namely, for every X G M'^' it holds 



where 



CVaRa(A:) = sup -q^ X, (3.1) 



Q=<igGMl^l:5]g. = 1, {)<qi<r^, i = 1, . . . , |0| ^ . (3.2) 



11 




By introducing the convex and closed sets 

U = <^xGMI^I :^x, = 1 

V ={x& M'^' : < X, < ^^, i = l,...AVL\ 
L 1 — a 

we obtain Q = U r\V , hence, for every X € M'^' it holds 

CVaR„(X) = 5*q{-X) = 5*jj^y{-X) = {5u + 5vT {-X) = {5*uU5*y) (-X), (3.3) 

where the last equality follows from [2, Theorem 15.3 & Proposition 15.5] and the fact 
that the intersection of the relative interiors of the sets U and V is nonempty. 
Thus, for given Ri G Ml^l,i = 1, ■■■,N, the portfolio optimization problem 

_ (3.4) 

can be equivalently written as 

inf {5^N{x)+5s{x) + 5T{x) + {51jU5*y){Rx)], (3.5) 

where S and T are the sets already introduced in the previous section and R : M^ -^ W^' 
is defined as R{xi, ...,xn) = — Z)j=i ^iRi- 

One can easily notice that the optimization problem (3.4) fits in the formulation of the 
general convex optimization problem (P), all the extended real- valued functions present 
in its objective being proper, convex and lower semicontinuous, and thus it can be solved 
by means of Algorithm 1.1. We would also like to point out that the primal-dual splitting 
algorithms proposed in [13, 25] are also designed to solve convex optimization problems 
involving infimal convolutions, however, they cannot be applied in this situation. This is 
because they require that one of the two functions occurring in the infimal convolution are 
strongly convex, which is for the problem (3.5) not the case. For the implementation of 
Algorithm 1.1 one has only to determine the projections on some simple convex and closed 
sets, for which one actually has explicit expressions. We would also like to emphasize 
that, from this point of view, it is p referable to work with the sets U and V separately, 
instead of dealing with their intersection Q. 

Remark 3.1. The approach described above can be analogously employed when consid- 
ering portfolio optimization problems having as objective a weighted sum of Conditional 
Value-at-Risk functionals taken at different levels of confidence. 

4 Numerical experiments 

4.1 Simulated data 

The first numerical experiments we made followed the scope to compare different ap- 
proaches for solving the portfolio optimization problem which assumes the quantification 
of risk by means of the Conditional Value-at-Risk. More precisely, we compared the 
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performances of Algorithm 1.1 when appHed in the context of the approaches proposed 
in the sections 2 and 3, but also with the linear programming approach, widely used in 
this context in the literature. To this end we used synthetic data obtained by creating 



\n\ 



,1 



1, ....,N, where N represents the number of assets in the 



random returns Ri G 
portfolio. 

We first solved with the Matlab routine linprog the reformulation of (3.4) as a 
linear program, that can be easily obtained by means of (2.7). Then we used the primal- 
dual method given in Algorithm 1.1 to solve (3.4) via two different approaches, namely, 
on the one hand, by solving the reformulation (2.4) proposed in Section 2 and, on the 
other hand, by solving the reformulation (3.5) given in Section 3 and relying on the dual 
representation of the objective. We terminated the algorithms when subsequent iterates 
start to stay within an accuracy level of 1 % with respect to the set of constraints and 
to the optimal objective value reported by the linear programming solver. Within these 
examples we used the confidence level a = 0.95. The algorithms were implemented in 
Matlab on an Intel Core 15-2400 processor under Windows 7 (64 Bit). 



\n\ 


N 


LP (linprog) 


OCE (iterations) 


DR (iterations) 


1000 
1000 
1000 


100 

500 

1000 


1.44s 
11.81s 
29.11s 


0.07s(250) 
3.45s(1078) 

12.85s(1772) 


0.06s(247) 
1.66s(519) 
3.94s(546) 


10000 
10000 
10000 


100 

500 

1000 


27.18s 
248.79s 
505.66s 


1.89s(252) 
38.76s(1087) 
174.05s(2465) 


1.36s(185) 
12.48s(351) 
27.79s(394) 



Table 4.1: CPU times in seconds for solving the portolio optimization problem when using the 
linear programming (LP) approach, the Optimized Certainty Equivalent (OCE) approach and 
the dual representation (DR) approach. 

The computational results presented in Table 4.1 show that both approaches proposed 
in this paper outperformed to linear programming approach from the point of view of the 
time needed to determine an optimal portfolio for different choices of A'^ and 1 17 1 . When 
running Algorithm 1.1 we used the following parameters: 



OCE: di 

DR {\n\ : 

A = 1.99; 

DR (\n\ 



50, (72 = 50 
1000): ai = 



as = 70/ \\K 

2, £72 = 2 



L2,T = 3/{ai -F 0-2 + 0-3 ||A'||^2 
£73 =0.1/||ii||^2,r 



),A 



1.99; 

|2 



2/{ai + a2 + a3\\R\\i2), 
10000): 0-1 = 0.1, cr2 = 0.1, (73 = 0.001/ ||i2||^2, r = 2/{ai + a2 + 



a3\\R\\i2),X = l.99. 

4.2 Real data 

For the experiments described as follows we took weekly opening courses over the last 
13 years from assets belonging to the indices DAX and NASDAQ in order to obtain the 
returns Ri G ]Rl"l,i = 1,...,A^, for \n\ = 689 and N = 106. The data was provided 
by the Yahoo finance database. Assets which do not support the required historical 
information like Volkswagen AG (DAX) or Netfiix, Inc. (NASDAQ) were not taken into 
consideration. 

We solved the portfolio optimization problem (2.1) by taking as objective function 
the corresponding convex risk measures induced by the linear, exponential, indicator. 
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quadratic and logarithmic utility function. More precisely, we solved with Algorithm 
1.1 its equivalent reformulation (2.1) and used to this end the formulae for the proximal 
points of each utility function given in Section 2. The values of the expected returns 
associated with Ri, i = 1, ..., N ranged from —0.2690 (Commerzbank AG, DAX) to 1.4156 
(priceline.com Incorporated, NASDAQ). 

Table 4.2 collects some computational results when computing by using Algorithm 
1.1 optimal solutions for the five utility functions. The stopping criterion was the same 
as in Section 4.1. Table 4.2 shows that Algorithm 1.1 in combination with the worst-case 
risk measure, i.e., the one induced by the indicator utility function, performed poorly 
on the given dataset. It also shows that the algorithm is sensitive with respect to the 
lower bound of the expected return fi* . When calculating the proximal points of the 
exponential utility function we used five iterations of Newton's method with previous 
iterates as starting points to obtain an appropriate approximation. 



f^* 


linear (a = 0.95) 


exponential 


indicator 


quadr. {(3=1) 


log. {9 = 5) 


0.3 


0.14s(500) 


0.18s(402) 


- (> 15000) 


0.05s(170) 


0.53s(1891) 


0.5 


0.15s(520) 


0.15s(336) 


- (> 15000) 


0.06s(196) 


0.38s(1335) 


0.7 


0.33s(1202) 


0.31s(682) 


- (> 15000) 


0.06s(186) 


0.72s(2570) 


0.9 


0.32s(1164) 


0.40s(885) 


- (> 15000) 


0.08s(272) 


1.07s(3820) 


1.1 


0.41s(1526) 


6.80s(15222) 


- (> 15000) 


0.14s(486) 


1.18s(4198) 


1.3 


0.42s(1570) 


5.45s(12155) 


- (> 15000) 


0.41s(1476) 


6.61s(23547) 



Table 4.2: CPU times in seconds and the number of iterations when solving the portfolio opti- 
mization problem (2.1) under different utility functions. 

Figure 4.1 shows the efficient frontiers for problem (2.1). When using the indica- 
tor utility function we stopped the algorithm after a number of 30000 iterations. The 
objective value, however, still oscillated in this scenario. 
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(a) linear utility 



(b) exponential utility 



risk 




(c) indicator utility 



risk 



(d) quadratic utility 



risk 



(e) logarithmic utility 



risk 



Figure 4.1: The efficient frontiers for the portfolio optimization problem under different convex 
risk measurements. 
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